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Open Source won. 


Open Source won. 
Why? 


Why? 


/* Copyright 1984 Massachusetts Institute of Technology 


Permission to use, copy, modify, and distribute this program 

for any purpose and without fee is hereby granted, provided 

that this copyright and permission notice appear on all copies 

and supporting documentation, the name of M.I.T. not be used 

in advertising or publicity pertaining to distribution of the 
program without specific prior permission, and notice be given 

in supporting documentation that copying and distribution is 

by permission of M.I.T. М.І.Т. makes no representations about 

the suitability of this software for any purpose. It is pro- 
vided "as is" without express or implied warranty. x / 


Figure 3. Copyright notice for C language programs first used in the February 1, 1984 distribution of PC/IP 
and the distributions later that year of the C-Gateway and EGP for the LSI-11. 


[1] https://web.mit.edu/Saltzer/www/publications/MITLicense.pdf 


Winning -> new challenges 
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Winning -> new challenges 


Richard Blumenthal & @SenBlumenthal - Jun 6 

Meta released its advanced Al model, LLaMA, w/seemingly little 
consideration & safeguards against misuse—a real risk of fraud, privacy 
intrusions & cybercrime. Sen. Hawley & | are writing to Meta on the steps 
being taken to assess & prevent the abuse of LLaMA & other Al models. 


Stefano Maffulli 


—— 


The US Senate subcommittee оп Privacy Technology апа Law 
asked Meta questions about their “leaked” #Al model LLaMa. 


Forget for a moment that this is Meta: | fear that if these 
questions were asked to any #OpenSource developer wed all be 
shaking in fear for the whole ecosystem. 


New challenges -> OPA 


Global collaboration 
Open Policy Local education 
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Bringing non-profit organizations together 


What The Open Policy Alliance is designed to bring non- 
profit organizations together to participate in 
educating and informing US public policy decisions 
related to Open Source software, content, 
research, and education. 


New challenges: Al | 


What is the definition of Open Source Al that serves the goals of Open Source: 


e Autonomy 
e Transparency 
e Collaborative Improvement 


Is the right focus still “preferred form of making modifications to the work?" 


New challenges: 
Defining Open Source Al 


Four parts to an Al System 


e Model architecture 

e Data: both in its raw state and 
prepared datasets for training and 
testing 

e  Learnable parameters: weights and 
biases 

e Software: for training and testing, 
inference and analysis 


hidden layer 
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Data in “open” ML models 


Pro: maximum ability 
to train open models 
on private data. 


Pro: description of 
data (e.g., hospital 
patient demographics) 
provides some 
transparency but 
allows open model 


Pro: queries permit 
even more 
understanding and 
illumination of bias 
(e.g., ratio of male to 
female medical data) 


Pro: transparent, 
though rights to 
redistribute/download 
are vague. 


Con: no transparency 
into the data other 
than weights analysis 


Con: no ability to 
train open models on 


training on private allows open model 
or input/output OA dita gonp кен н private private data, leaving 
dida potential insights and 


Con: still no full collaborations (e.g. 
transparency; by Con: still no full medical) out of the 
describing data transparency definition. 

adding selection bias 


m to what you describe; 


No data info Data source described | Partial data Data public 
available 
(J e e ® @ 


A spectrum 


Data under open license 


Pro: transparent, able to 
be redistributed to 
rebuild/retrain model with 
different architecture. 


Con: no ability to 
leverage public 
information; no ability to 
train on private data. 


increasing surface 
area of concern. 


New challenges: Al | 


Looking ahead: Big questions for Open Source Al as it challenges closed Al 


Open washing — publicly licensed models allow community optimization but keep upside. 
Will there be equal access to (public) data for training? 

What is the right scope for exceptions to regulations for Open Source Al development? 

How will communities build, govern, and maintain different parts “open source” Al systems? 
Will there be a coherent “upstream” for open source Al systems? 


What you can do. 


Join OSI: https://opensource.org/join 
Join our deep dive conversations: https://opensource.org/deepdive/ 
Join the OPA: https://opensource.org/programs/open-policy-alliance/ 


Donate (or have your company donate) to OSI: https://opensource.org/donate 


